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INTRODUCTION 


Domain decomposition techniques have become a popular way to numerically 
solve partial differential equations, especially when geometric complexity 
needs to be addressed [1,2]. The introduction of parallel computers seems 
naturally suited for these techniques because each separate domain can be 
assigned to its own processor. These techniques are especially appropriate 
for use with pseudospectral methods due to both the global nature of the spec- 
tral derivative matrices and the high condition number of these matrices as 
the number of collocation points increases. 

However, the interconnection of the individual domains, and the method of 
solution of the overall numerical problem have not yet taken advantage of the 
parallelism inherent in multi-domain techniques [3]. In the present work, we 
investigate interface boundary conditions and show that a judicious choice of 
these conditions leads naturally to an algorithm which has a high degree of 
concurrency, even when pseudospectral methods are applied to each of the sub- 
domains . 

Several methods will be presented for a model elliptic problem. Each of 
these relies upon the properties of spectral methods for its applicability to 
domain decomposition techniques. For a simple two domain problem, a novel 
concurrent forward/backward elimination procedure is used. In order to 
generalize this result to three or more domains, a different interface 
boundary condition, that makes use of the special properties of spectral 
methods, is introduced. These procedures are also discussed in relation to 
influence matrix techniques. 
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SECTION I 

Certain features of pseud ospectral methods, relevant to the issue of 
parallel computing, can be identified by considering the pseudospectral 
Legendre (PSL) discretization of the simple equation 

u xx = f(x) -1 x _< 1 (1.1) 

with the boundary conditions 


a u(-l ) + Bu (-1 ) = b 
x 1 

Y u(l ) + & U X (D = b 2 * 


( 1 . 2 ) 


Let Xj j = 0,»»»,N be defined as follows 


Xq = -1, x N = +1, and x ^ , j = 1,»»«,N-1 are the extrema of p N , i .e . , 


W -° 


( 1 . 2 ) 


where p^ is the Nth order Legendre polynomial 


In the PSL discretization, one seeks a polynomial u N , of degree N, such that 


.2 
3 u 


N 


3x‘ 


2 “ f(x) at x = Xj j = !,•••, N-l 


(a) 


3u 


N 


« u„(-l) + 8^ (-1) - bj 


(b) (1.3) 


3u iu 

Y U N (I) + { JT (1) " V 


(c) 


In practice we express (1*3) as a linear system of equations for the grid 
point values u N (xj). In fact u^(x), as an Nth order polynomial, is 

identical with its Nth order interpolation polynomial, thus 


N 


U N <x) “ J. 

k=0 


(1.4) 


where g k (x) are the Lagrange interpolation polynomials 


g t ( *> 


(l-x z )P^(x) 


X- 


\ 


P N (x k )N(N+l) 


(1.5) 


such that g^(xj) = 6^. 


Substituting (1.4) into (1.3) one gets 


N 


l g^'(Xj> u N ( x k^ = f < x j) 3 = 1»***»N-1 


N 


l {0t6 kO + K (X 0 )}U N ( \ ) " b l 
k=0 


N 


J. tTS Nk + Sg k ( V , V\ > ' b 2 ' 

k=0 


(a) 


(b) 


(c) 


( 1 . 6 ) 


The system of linear equations (1.6) is the PSL discretization of (1.1)(1.2). 

It should be noted that all other pseudospectral methods can be expressed 
in the same way, the only difference being the definition of the collocation 
points and consequently the Lagrange interpolant gfc(x). For example, in the 
pseudospectral Chebyshev method, Xj are the extrema of 
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T n (x) = cos (Ncos -1 x) and g k (x) is given by 


2 V x) 1 

gk(X> " (1 " X } X ‘ X k ' N 2 * 


(1.5a) 


c 0 = c n = 01 c j = 1 0 * J * N * 


For the PSL method it has been shown [4] that 


g"( Xj ) - 


W ( Xj -x k ) 2 
C ( V -- 17-2 ■ 


1 , • • • , N-l 


(1.7) 


Thus the first important feature of pseudospectral methods follows the obser- 
vation that the system in (1.6) is full . This is true for all spectral 
methods; the fact that (1.4) is a global interpolation results in full 
matrices approximating derivatives. This is in contrast to local methods like 
finite differences in which the matrices are banded. 

The other important feature of pseudospectral methods is a consequence of 
the fact that the collocation points Xj are the nodes of the Gauss-Lobatto 
quadrature formula. It is known that the Gauss-Lobatto quadrature formula is 
exact for every polynomial h(x) of degree at most 2N-1. Thus there exists 
positive weights w^ , • • • , w^ such that 


1 I’ 

/ h(x)dx = l h( x, )w, , 

-1 j=0 3 3 


i.e., the spectral sum is exactly the integral. 
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In particular if f^j is the interpolant of f at the points Xj 

1 N 

/ f (x)dx = l f(x,)w . (1.9) 

-1 j=0 J J 

The significance of the above mentioned properties of spectral methods is one 
of the main themes of this work. The fact that pseudospectral discretizations 
result in full matrices creates enormous difficulties in the application of 
spectral methods. We will attempt to show that one can use (1.8) in order to 
modify this difficulty, in certain circumstances, with the use of parallel 
architecture . 

SECTION II 

Multidomain spectral techniques have been extensively used by practi- 
tioners since the beginning of this decade. In many applications these meth- 
ods were proved to be superior to one domain spectral techniques allowing more j 

flexibility in the geometry of the problem as well as easier inversion of the 
matrices appearing in spectral methods. 

| 

We refer the reader to the review article [1] for a detailed discussion 

i 

i 

on the merits of multidomain spectral techniques. Here we would like to 
analyze these methods from the point of view of parallel computing; the cri- 
terion that we suggest is whether a multidomain technique can be cast as a set 
of disjoint problems, such that each problem can be solved in a different ! 

processor concurrently. i 

i 

We start by analyzing a variety of two domain PSL methods applied to 

i 

equation (1.3). The domain [-1,1] is being divided into two domains [-1,0] 
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and [0,1] so that we have to discretize a system of two equations 


a 2 I t 

— 5— = f (x) -1 < x < 0 (a) 

9x 

( 2 . 1 ) 

a 2 11 tt 

~r— = f (x) 0 X £ 1 (b) 

3x 


with the boundary conditions 

<*u I (-l ) + 3 (-1) = bj (a) 

tt a 11 

yu (1) + 6 (1) = b 2 * (b) 


( 2 . 2 ) 


The PSL method is now applied separately to (2.1a), (2.2a), (2.1b), and 
(2.2b). It is clear however that two more conditions should be given in order 
to make (2.1) and (2.2) consistent with (1.3). The first condition is the 
same for all multidomain techniques, this is the continuity condition 


u X (0) = u IT (0). 


(2.3) 


It is the choice of the second condition that distinguishes between the multi- 
tude of multidomain techniques. The simplest possibility is to impose the 
condition for the continuity of derivatives at the point x - 0 

3r (0) -Tr (0) ‘ (2 - A) 


It is interesting to note that for 3=6=0 in (2.2) one can diagonalize 
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the system (2*1) with the conditions (2.2), (2.3), (2.4) and thus present 

(2.1) as a system of two equations that are completely decoupled. In fact 
upon defining 


R(x) 


U T (X) + u n (-x) 
2 


and 


S(x) 


u T (x) - u IT (-x) 
2 


one gets the following system 

i [ f T ( x) + f n (-x)l 

-1 < x < 0 (a) (2.5) 

J [f T (x) - f n (-x)l 

(b) 

S(-1) = J [— - S(0) - 0. 

This is in fact the same way that was used in [5] to show that the eigenvalues 
of the two-domain second derivative matrices are real negative and distinct. 
We believe that the ability to diagonalize the system of equations resulting 
from a particular multidomain technique is an indicator for parallelizing the 
solution of the equations resulting from the discretization of the original 
system. A more detailed discussion of this connection will be presented in a 
future paper. 

We turn now to the PSL method for the discretization of (2.1 )— (2.4). Let 
the collocation points for the first domain x^, j - 1,***,N-1 be defined 
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by 


I 

9 jT (2 X j + 


(a) 


note that 


-1 _< 0, and let x** be defined by 


9p N II .v 
W (2x j " ° 


(b) 


xj = -1, = 0, Xq* = 0, and x ** =1. We assume here for simplicity that 

the number of points at each domain is the same but of course this is not 
essential • 

I 


Denote by (uj,***,u^) the values of 


u 


N 


at the collocation 


points x^ and denote by (uq* , • • • ,u**) the values of u**. Equation 
(2.3) means that 


I II 
U N ” U 0 


and we denote this by w. 

Equation (2.4) leads to 


i *k ( *N )u k ’ J 


II s II 


( 2 . 6 ) 


Applying now the method described in (1.6) to discretize (2.1) and using (2.2) 
we conclude the PSL discretization for the two domain method can be described 
as follows 


! 



-9- 



(2.7) 


where A^, A 22 are N*N matrices, h^j are N-component vectors and a is 
a scalar [6]. For convenience we define 



As an example, we write down the case N = 4. The x denotes a nonzero 


element 
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r- I -1 

u o 


' b i" 

I 

U 1 



I 

U 2 



I 

U 3 



w 

= 

0 

II 

U 1 



II 

U 2 



II 

U 3 



II 

L u 4 - 


- b 2 - 


From the structure (2.7) and (2.8), it can be seen how the solution of (2.7) 
can be carried out in two processors in parallel. In the first processor we 
use the Gaussian elimination for the matrix 



to get to an upper triangular matrix; concurrently the matrix 
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ls being transformed to a lower triangular form in the second processor. The 
system (2.8) is thus being transformed to the form 


U 

0 



(2.9) 


U is an upper triangular NxN matrix and L is lower triangular NxN 
matrix. The value of w is first determined and backward/ forward substitu- 
tion is then carried out, again concurrently within the two processors. 

The procedure described above is strongly related to the fact that the 
system (2.1)- (2. 4) can be diagonalized, however it is more general since even 
if (2.1 )— (2.4) cannot be diagonalized the parallel inversion of (2.8) can 
still be carried out. For example if 8 or 6 are different from zero 

in (2.2) or if (2.1) is of the form (a(x)u x ) x = f, diagonalization is 

impossible but the form (2.7) is still the same and the parallel solution of 
(2.8) is the same. 

A different approach to the solution of (2.7) is to rewrite the system 
explicitly 


A^u 1 + h J2 w = f 1 

(a) 


A^u 11 + h 32 „ - f 11 

(b) 

(2.10) 

hjj • u 1 + aw + h^u** = 0. 

(c) 


I .-1,1 A -l. 

u A n f w A n h 12 

(a) 
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II - A" 1«H A-k 

u ^22* ^ z 2^32 

(b) 

(2.11) 


.T .-1,1 .T .-1,11 

h 2 1 A 1 1 f + h 23 A 22 f 

w * _ h T A _1 h b T rK * 

(c) 



h 21 A ll h 12 h 23 A 22 h 32 



The system 

(2.11) seems to be, at first glance, more 

complicated 

than i t 

really is. 

In fact, let P}» P2» P 39 P 4 ^ N-component 

vectors such 

that 


A n p i = fI 

(a) 



A 1 1 P 2 = h 12 

(b) 

(2.12) 


a 22 P 3 = fI1 

(c) 



A 22 p 4 = h 32* 

(d) 



Then equation (2.11) can be written as 

h 21»l + h 23 T 3 
w 

a - h 21 p 2 - h 21 p 4 
I 

u - P x - wp 2 
II 

u - P 3 - wp 4 . 


(2.13) 


It is self evident now that a part of the solution procedure can be carried 
out in parallel. The vectors P 1> ,,, ,P 4 can be computed in parallel in four 
(or two) different processors. The inner products h 21 P l’ h 23 p 3’ h 21 P 2’ 
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T 

and ^21^3 Can a S a * n done in parallel. Equation (2.13) is basically 
gathering information from the processors to the shared memory. It seems that 
the procedure outlined in (2.9) is more efficient than (2. 12 )-(2. 13 ) ; however 
(2. 12)-(2. 13) can be easily generalized to more than two domains. 

The procedure outlined in (2.12 )— ( 2 . 13) has a very interesting interpre- 
tation on the differential equation level. In fact equation (2.12a) is a dis- 
cretized version of the equation 


with 



-1 < x < 0 


(2.14) 


3q l 

a q ^ (— 1 ) + B (-1) - b 1 q^O) - 0 


whereas (2,12b) is the PSL discretization of 


» *2 

=0 -1 < x < 0 

3x 

aq 2 

a q 2 (-l) + B jf- (-D = 0 q 2 (0) = -1. 


Similarily (2.12c) and (2.12d) are the discretization of 


and 



3< h 

q 3 (0) * 0 Yq 3 (l) + 5 (1) “ l»2 



(2.15) 


(2.16) 


(2.17) 
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9 9 4 

q 4 (0) = -1 Yq 4 (l) + 6 JT (1) = 0> 

It is well-known that the general solution to (2.1a) and (2.2a) is 

uhx) = q 1 - wq 2 « (2. 18) 

The continuity condition 

u T (0) = u TI (0) 

implies that 

u IT (x) = q 3 - wq 4 (2.19) 

and (2.13a) is the manifestation of the condition (2.4). 

The advantage of the formulation (2.14 )— ( 2 . 19) is that it lends itself 
easily to generalization without getting into the details of the discretiza- 
tion. 

We conclude by summarizing the main results in this section: 

1. We showed that for Dirichlet boundary conditions, the system (2.1) and 
(2.2) can be diagonalized and argued that each one of the equations can be 
solved in a different processor. 

2. We presented a general technique to solve concurrently the system (2.7) by 
assigning each separate domain to a separate processor [e.g., (2.9)]. 

3. We presented a different procedure (related to the influence matrix 
method) and gave an interpretation on the level of the differential 


equations 
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In Section IV, we will discuss methods 2 and 3 in the general case of more 
than two domains. In the next section, we generalize method 1 to a two dimen- 
sional equation. 


SECTION III 

The extension of the techniques mentioned in Section II to the two dimen- 
sional Poisson equation seems to be obvious. However, there are some dif- 
ferences in implementation. The two dimensional extension of (1.1) is the the 

equatl0n -1 < X < 1 

u + u “ f(x,y) . (3.1) 

XX yy -1 < y < 1 

Here we will discuss only Dirichlet boundary conditions of the form 


u(x,-l) = bjCx) 
u(x,l) = b 2 (x) 


u(-l,y) = b 3 (y) 
u(l,y) = b 4 (y). 


(3.2) 


The original domain is divided now into four domains 

Bj - {-1 < x < 0, -1 < y < 0} ^ " {0 < * < 1 , -1 < y < 0} 


£2 3 = (-1 < x < 0, 0 < y < 1} n 4 " (0 i * < 1. 0 < y < 1). 


(3.3) 
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T— 1 

cz 

1 


It Is readily verified that (3.1)(3.2) can be factored into four independent 
problems if one imposes continuity of the function and normal derivative at 
interfaces . 

In fact denote by u 1 , u 11 , u 111 , u IV the solution u(x,y) in 
The interface conditions amount to 

u T (0,y) = u n (0,y) -1 < y < 0; u Ii:i (0,y) = u IV (0,y) 0 < y < 1 

(3.5) 

I , II a IV a IV 

3y (o>y) ■ -*fy (°»y) “gy (°»y) = Ty“ (°>y) 

u I (x,0) = u I1:I (x,0) u Ii; (x,0) - u IV (x,0) 

-1 < x < 0 0 < x < 1 . (3>6) 

3u 1 (x > 0) _ 3u I11 (x > 0) 3u II (x,0) _ 3u IV (x,0) 

3x 3x 3x 3x 


We define now 


Rj(x,y) * u X (x,y) + u Ii: (-x,y) + u IIi: (x,-y) + u IV (-x,-y) 
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R 2 (x,y) = u*(x,y) - u i:t (-x,y) + u III (x,-y) - u IV (-x,y) 

R 3 (x,y) = u T (x,y) + u IX (-x,y) - u 111 ^,-^) - u IV (-x,-y) 

R 4 (x,y) = u X (x ,y) - u 11 (-x,y) - u IIT (x,-y) + u IV (-x,-y) 

The functions Rj, R 2 , R 3 , R 4 satisfy trivially the Equation (' 

defined "in (3.4). Moreover 


RjC-l.y) = b 3 (y) + b^(y) + b 3 (-y) + b^(-y) 
R 2 (_ 1 »y ) = b 3 (y) - b^(y) + b 3 (-y) - b^C-y) 
R 3 (-l»y) = b 3 (y) + b^(y) - b 3 (-y) - b^(-y) 


R 4 (“l.y) = b 3 (y) - b 4 (y) - b 3 (-y) + b^C-y). 


In the same way we can express each R^Cx.-l) i * ].,•••, 4 as 
of b^(±x) and b 2 (±x). Moreover, e.g., (3.5) yields 


R 2 (0,y) = R 4 (0,y) = 0 


3R, 3R, 

w <0,y) - (0,y) - 0 


(3.7) 


. 1 ) in fij 


(3.8) 


combinations 


(3.9) 


and, e.g., (3.6) yields 
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R 3 (x,0) 



(x,0) 


= R 4 (x,0) = 0 

a R i 

= TT (x ’ 0) = °* 


( 3 . 10 ) 


Thus, RjCx.y), R 2 (x,y), R 3 (x,y), R^x.y) can be solved concurrently. 
To get back u*,» ## ,u * one concludes from ( 3 . 7 ) 


-1 < x < 0 

u X (x,y) = t- {R, (x,y) + R.(x,y) + R-(x,y) + R,(x,y)} 

4 -1 < y < 0 


0 < x < i 

u IX (x,y) = 7- {R, (~x,y) - R_(-x,y) + R„(-x,y) - R, (-x,y)} 

Z -1 < y < 0 


u IIX (x,y) = -7- {R, (x,-y) + R«(x,-y) - R-(x,-y) - R, (x,-y)} 

0 < y < 1 


0 < x < i 

u IV (x,y) = x {R.(-x,-y) - R„(-x,-y) - R_(-x,-y) + R,(-x,-y)} 

41 Z 0 <y< 1 

Equation ( 3 . 7 ) shows a way to fully parallelize the solution of ( 3 . 1 ) using 
four different processors. This is an extension of ( 2 . 5 ) which we adopted as 
an indicator for possible parallelism. Investigations into the choice of 
interface conditions leading to multidomain techniques that can easily be 
parallelized will be reported in a future paper. 



SECTION IV 


We return to Equation (1.1) to discuss multidomain techniques with more 
than two domains. We will discuss also a modification of (1.1) in the form 

(h(u,u x )) x = f ( x) -I _< x _< 1 (4.1) 

h(u,u ) = b- at x = -1 
x 1 

u = at x = 1. 


Actually this is the form that appears most often in applications. 

Nothing essential is lost if we restrict ourselves to three domains and 
for simplicity we will discuss this case only. Generalization to more domains 
are trivial. 

We start by showing that our indicator for parallelism shows that impos- 
ing continuity of u and as an interface condition for (1.1) (or con- 

tinuity of u and h(u,u x ) in the case of (4.1)) leads to a method that can- 
not be factorized into disjoint problems. Let u 1 , u 11 , U I1I be the 

solutions of 

U 1 = f 1 -1 < X < 5, 

XX — — 1 


II -II 
u = f 

XX 


III ,111 
u * f 

XX 


< X < 5 2 


S 2 < x < i 


(4.2) 


with u^(-l) - u ITI (l) = and the interface condition 
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u T (C 1 ) = u II (C 1 ) 


TT« l>-TT-<*i> 


II,, x III,, x 
u (S 2 ) ■ u (C 2 ) 


, II . Ill 

3u (f ) = in (f ) 

3x ''V 3x '^2 ' * 


(4.3) 


As in (2.5) we map 


[-1 1 + [g 2 .Sj] and [C 2 ,l] t^,^] 


~I ~II ~III 

and denote the new dependent variables u , u , u . We have now 


~I ~I 
u = f 

XX 


with 


~II -211 
u = f 

XX 


X e [Cj ,5 2 1 


(4.4) 


~III 

u 

XX 


fin 


u I (5 2 ) 


~III,, X . 

u <«i> " b 2 


(4.5) 


^vT <vT T ~T T T T T 

u(5j) - u(5j) u (5 2 ) = u (5 2 ) 


9u I (C 1 ) 3u II (5 1 ) 3 u XI (5 2 ) 3u TII (5 2 ) 


3x 


3x 


3x 


9x 


We seek now a combination of the form 


~I ~tti ~T T T 
R = Au + Bu + Cu 


(4.6) 
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that will satisfy the differential equation at 

~I 


1 x - ^2 


II ~III 

y 


with boundary 

conditions that are independent of u 4 *, u 4 ’ 4 ', u 4 ' 4 * 4 '. In fact we need three 
different values of A, B, C to decouple the problem completely. However, it 
is easily verified that there is only one quantity that is decoupled. Namely 


-I . -II -III 
-u + u - u 


(4.7) 


is the only quantity that can be computed separately from the other two. 

This conclusion becomes more apparent when we observe the structure of 
the PSL method for three domains. In analogy to (2.7) we get the system 


11 

T 

21 


12 


31 


23 

22 

T 

41 


32 


‘51 


42 


33 


fin 

u 


1 

M 

1 

W 1 


0 

II 

u 


f 11 

W 2 


0 

III 

u 


_f m _ 


(4.8) 


where are NxN matrices h^j are N component vectors and a^ are 

scalars. The unknowns are 


I , I I . II II II. Ill , III III. 

u = (u 0 »* ** » U N _ 1 ) » u - u l > -*,u N ), u = (u x ,---,u N ) 


Wj stands for u* ■ uj* and W 2 stands for u ^+i = u q^* 

We write down explicitly the case N * 3, the x denotes nonzero 


elements 
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(4.9) 


It is clear from (4.5) that we can carry out Gaussian elimination in parallel 
only for two blocks, the first is (2N+2)x(2N+2) and the second 
(N+l)x(N+l), so at this point nothing has been gained from using three 

domains. This corresponds to the observation that there is only one quantity 
that is independent of the others. It seems that the method outlined in (2.9) 
does not carry over to more than two domains. We will show later that a 
judicious choice of the interface boundary condition can lead to a separable 
set of equations. 

The method outlined in (2. 10)-(2. 12) is still valid. We first define 
p*, p 11 , P 111 , P 21 , P22> p 32» P 31 to k e the so l ution vectors of 
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a pi = fl A P = h 

A 11 A 11 12 12 


(a) 


A 22 P f A 22 P 21 “ h 31 A 22 P 22 ~ h 32 ^ C^* 10 ) 


A pi 11 = f m a P = h . 

*33 1 *33 31 n 51 


(c) 


It Is readily verified that if Wj, W 2 are the solution of the system 


w l [a l h 21 P 12 " h 23 P 21 1 + W 2 [a 4 h 23 P 22 1 h 21 P h 23 P 


r .T m - r ,T „ T . T II T III 

W 1 a 3 h 41 P 21 + w 2 a 2 ~ h 41 P 22 h 42 ? 31 1 h 41 P h 41 P 


(4.11) 


then 


n I 

P “ W 1 P 12 


II »H r, 

u = p - Wj P 21 - w 2 P 22 


III _III _ 
u =P -w 2 P 31 . 


(4.12) 


The procedure (4. 10)-(4. 12) shows that one has to solve three equations per 
inner domain, c.f. (4.10b), and two equations per boundary domain; (4.10a,c) 
this can be done in parallel. Equation (4.11) has then to be solved. The 
dimension of this system is proportional to the number of domains. 

The method described in (4. 10 )- (4. 12) has, again, an interpretation on 
the differential equation level. The vectors P*, P**, pIU are the approx- 
imations to the Equation (1.1) (or (4.17) with homogeneous boundary conditions 
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in each domain). The vectors are the approximations to the solutions of 
the homogeneous problem with homogeneous boundary conditions at one side and 
value -1 at the other side. Equation (4.11) expresses the interface condi- 
tions . 

We claim now that by a better choice of the interface condition one can 
greatly simplify the solution procedure. In fact integrating (4.1) from -1 
to 5 j one gets 

5 1 

h(u,u x )(x = Cj) - h(u,u x )(x = -1) = / fdx. (4.13) 


h(u,u x )(x = 5^) = bj + / fdx. (4.14) 


We note the exact result, that within the context of the PSL method 

5 1 N 

/ (P f)dx = l f(x. )w 
1 j=0 - 1 3 

and therefore the integral is known explicitly. We therefore replace (4.2) 
and (4.3) by 

h(u I ,u I ) = f 1 -1 < x < 

xx — — 1 


. / II IIv rll 

h(u ,u ) = f 

9 X X 


h < x < *2 


, . Ill III. .Ill - . . . 

Mu ,U X >X = f 5 2 < x < 1 


(4.15) 


with interface conditions given as 
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! 

I ly. x Hf c X 

U (? 1 ) = u (?j ) 


II 

u 


(C 2 ) = u ITI (5 2 ) 


(4.16) 


h(u II ',u“)(x = 5j) = bj+ I h(u III ,u^ II )x(x = 5 2 ) = bj+ I fhx^Wj 


+ I fII ^ x j^ w ,i* 


When (4.15) and (4.16) are used, we get basically the system described in 

(4.8) with hjj = 0, a^ = anc * h 4i = 

With this notation we note first that the system (4.11) is bidiagonal . 
Alternatively, one can transform A u and A 22 to an upper diagonal form and 
the lower corner structure in (4.8), i.e., 



L h 51 A 33 J 


to a lower diagonal form and then solve for w 2 and wj» Backward substitu- 
tion will then be done concurrently. 

We conclude this section by emphasizing that interface boundary condi- 
tions have a great impact on the ability to carry out the solution procedure 
in parallel. 


ACKNOWLEDGEMENT 

The authors are indebted to Professor Saul Abarbanel for his help in 
proving the result in Section III. 

i 

f 


! 



-26- 


1EFERENCES 


1. A. Quarteroni: "Domain decomposition algorithms using spectral methods," 

to appear in Calcolo . 

2. A. T. Patera: "A spectral element method for fluid dynamics," J. Comp. 

Physics , Vol. 54, 1984, pp. 468-488. 

3. R. C. Y. Chin: "Asymptotics and domain decomposition in parallel comput- 

ing," presented at the Third SIAM Conf . on Parallel Processing, Los 
Angeles, December 1987. 

4. D. Gottlieb, M. Y. Hussaini, and S. A. Orszag: "Theory and application of 

spectral methods," in Spectral Methods for Partial Differential Equations , 
SIAM, 1984. 

5. D. Gottlieb and L. Lustraan: "The spectrum of the Chebyshev collocation 

operator for the heat equation," SIAM J. Num. Anal., Vol. 20, 1983, pp. 
909-921. 

6. H. C. Ku, R. S. Hirsh, and T. D. Taylor: "A C(0) pseudospectral element 

method for the solution of the Navier-Stokes equations in primitive vari- 
ables," presented at the First ICIAM Conference, Paris, June 1987. 



NASA-Langley, 1988 






